Databricks Reviews & Product Details

Databricks is the Data and AI company. More than 20,000 organizations worldwide — including adidas, AT&T, Bayer, Block, Mastercard, Rivian, Unilever, and over 60% of the Fortune 500 — rely on Databricks to build and scale data and AI apps, analytics and agents. Headquartered in San Francisco with 30+ offices around the globe, Databricks offers a unified Data Intelligence Platform that includes Agent Bricks, Lakeflow, Lakehouse, Lakebase and Unity Catalog.

Product Website

Seller

Databricks Inc.

Discussions

Databricks Community

Languages Supported

German, English, French, Italian, Japanese, Korean, Portuguese

Overview by

Colette Chavalia

Value at a Glance

Averages based on real user reviews.

Time to Implement

4 months

View More Pricing Information

Databricks Media

Databricks Demo - Automated ETL processing

Once ingested, raw data needs transforming so that it’s ready for analytics and AI. Databricks provides powerful ETL capabilities for data engineers, data scientists and analysts with Delta Live Tables (DLT).

Databricks Demo - Reliable workflow orchestration

Databricks Workflows is the fully managed orchestration service for all your data, analytics and AI that is native to your Lakehouse Platform. Orchestrate diverse workloads for the full lifecycle including Delta Live Tables and Jobs for SQL, Spark, notebooks, dbt, ML models and more.

Databricks Demo - End-to-end observability and monitoring

The Lakehouse Platform gives you visibility across the entire data and AI lifecycle so data engineers and operations teams can see the health of their production workflows in real time, manage data quality and understand historical trends. In Databricks Workflows you can access dataflow graphs an...

Databricks Demo - Security and governance at scale

Delta Lake reduces risk by enabling fine-grained access controls for data governance, functionality typically not possible with data lakes.

Databricks Demo - Automated and trusted data engineering

Simplify data engineering with Delta Live Tables – an easy way to build and manage data pipelines for fresh, high-quality data on Delta Lake.

Databricks Demo - Eliminate resource management with serverless compute

Databricks SQL serverless removes the need to manage, configure or scale cloud infrastructure on the Lakehouse, freeing up your data team for what they do best.

G2 reviews are authentic and verified.

Here's how.

Neeraj Kumar N.

AI Data Specialist | Transcription & Annotation Expert | AI Model Training at Sigma AI

Mid-Market (51-1000 emp.)

4/12/2026

"Unified Databricks Workspace That Streamlines Collaboration and Complex Data Workflows"

4/5

What do you like best about Databricks?

What I like best about Databricks is how it brings data engineering, analytics, and machine learning into one unified workspace. I find collaboration much easier with shared notebooks, and the seamless integration with big data tools saves me time. It simplifies complex workflows while still offering powerful capabilities when I need them. Review collected by and hosted on G2.com.

What do you dislike about Databricks?

One thing I dislike about Databricks is that it can feel expensive, especially for smaller projects or teams. I also find cluster configuration and cost management a bit complex at times. The interface, while powerful, can be overwhelming for beginners, and debugging distributed jobs isn’t always as straightforward as I’d like. Review collected by and hosted on G2.com.

What problems is Databricks solving and how is that benefiting you?

Databricks solves the challenge of handling large-scale data processing, analytics, and machine learning in one place. For me, it removes the hassle of managing separate tools and infrastructure. I benefit by working more efficiently, collaborating easily with my team, and turning complex data into useful insights faster, with less operational overhead overall. Review collected by and hosted on G2.com.

KARUPPUSAMY V.

Software Engineer

Mid-Market (51-1000 emp.)

4/11/2026

"Databricks Unifies Data, Analytics, and ML Into One Powerful Lakehouse"

5/5

What do you like best about Databricks?

The thing I genuinely appreciate most about Databricks is how it brings everything under one roof. Before Databricks, I was juggling separate tools for data ingestion, transformation, analytics, and ML, and honestly, it was a nightmare keeping them all in sync. With the Lakehouse architecture, I can store my raw data, run heavy Spark transformations, build dashboards, and even train ML models all in the same platform. Unity Catalog on top of that gives me a single place to manage permissions, lineage, and data discovery, which used to take me days of stitching together custom scripts. It just removes so much friction from the day-to-day.

From a workflow standpoint, the notebook experience with real-time collaboration has been a game changer for my team. We used to pass around Python scripts over Slack and hope nobody overwrote each other's work now we just co-edit in the same notebook and see results instantly. The Spark Declarative Pipelines (what used to be called Delta Live Tables) let me define my entire ETL as simple SQL or Python declarations, and the platform handles retries, data quality checks, and lineage tracking automatically. I also didn't expect the Jobs orchestrator to be as flexible as it is I can wire up multi-task DAGs with dependencies, set schedules, and get alerts without needing Airflow or any external scheduler. That alone saved us weeks of infrastructure setup.

The AI and intelligence side of things has honestly surprised me the most. The AI/BI dashboards let non technical folks on my team ask questions in plain English and get actual visualizations back, which means fewer "hey can you pull this data for me" requests in my inbox. Model Serving makes deploying an ML model or an AI agent to a production endpoint almost trivially easy compared to rolling your own Flask app on Kubernetes. And the integrations are solid whether it's connecting to external tools through the Databricks SDK, using the REST API, or plugging into AI coding assistants through things like MCP servers, it plays nicely with whatever stack you're already running. The ROI has been clear for us: less time on plumbing, more time on actual data work. Review collected by and hosted on G2.com.

What do you dislike about Databricks?

The pricing model is probably my biggest gripe with Databricks. It's based on DBUs (Databricks Units), and honestly, it can get confusing fast trying to figure out what's going to cost you what. You spin up a cluster for some quick testing, forget to shut it down over the weekend, and suddenly your bill looks like you were running a production workload for a Fortune 500 company. Even with autoscaling and serverless options, the costs can creep up in ways that are hard to predict, especially for smaller teams or startups that don't have a dedicated FinOps person watching the meter. I wish the pricing was more transparent and easier to estimate upfront without needing a spreadsheet and a prayer.

The learning curve is steeper than people let on. If you're coming from a traditional SQL background or you've only worked with simpler tools like basic ETL platforms, Databricks can feel overwhelming at first. There's a lot of concepts to wrap your head around workspaces, clusters, catalogs, schemas, warehouses, notebooks, jobs, pipelines and the documentation, while extensive, sometimes assumes you already know what you're looking for. Onboarding new team members takes longer than I'd like because there's no single guided path that says "start here, then go there." You kind of have to piece it together yourself or lean on someone who's already been through the pain.

Performance-wise, it's generally solid, but there are moments that test your patience. Cold starts on clusters can take a few minutes, which breaks your flow when you just want to run a quick query or test a small change. Serverless compute has improved this a lot, but it's not available for every workload type yet, and sometimes the serverless option has its own quirks with library compatibility. The notebook UI, while functional, can also feel sluggish when you're working with large outputs or long notebooks it's not as snappy as a local IDE. These aren't dealbreakers by any means, but they're the kind of paper cuts that add up over a long day of development. Review collected by and hosted on G2.com.

What problems is Databricks solving and how is that benefiting you?

The biggest problem Databricks solves for me is the "tool sprawl" nightmare that every data team eventually runs into. Before we adopted it, we had one tool for ingestion, another for transformation, a separate warehouse for analytics, yet another platform for ML experiments, and a bunch of glue scripts holding it all together with duct tape. Every time something broke, you'd spend half your day just figuring out which piece in the chain went sideways. Databricks basically collapsed all of that into a single Lakehouse platform where my raw data, curated tables, dashboards, and ML models all live together. That alone cut our incident response time dramatically because there's one place to look, one set of logs, and one lineage graph that tells you exactly where things went wrong.

The second major problem it tackles is the wall between data engineers and data scientists. In my previous setups, the engineers would build pipelines and dump data into a warehouse, then the data science team would export that data into their own environment, do their thing, and throw a model back over the fence for us to deploy. It was slow, error-prone, and nobody was ever working with the same version of the data. With Databricks, both teams work in the same workspace on the same datasets governed by Unity Catalog. The scientists can experiment in notebooks, register a model in MLflow, and I can pick it up and deploy it to a serving endpoint without any file transfers or format conversions. That back-and-forth that used to take weeks now happens in days, sometimes hours.

The benefit that honestly surprised me the most is how much time I got back for actual engineering work instead of babysitting infrastructure. Things like Spark Declarative Pipelines handle retry logic, schema enforcement, and data quality expectations out of the box stuff I used to write custom code for. The Jobs orchestrator replaced our Airflow instance, which means one less server to maintain and one less thing to patch and upgrade. Even the governance side got easier because Unity Catalog tracks who accessed what data and where it came from, which used to be a manual audit nightmare every quarter. All of that freed-up time means I can focus on building new pipelines and improving data quality instead of constantly firefighting operational issues. Review collected by and hosted on G2.com.

Chandhuru B.

Data Engineer

Information Technology and Services

Mid-Market (51-1000 emp.)

4/6/2026

"Reliable data platform with powerful pipeline support"

4.5/5

What do you like best about Databricks?

What I like best about Databricks is how it brings data engineering, analytics, and machine learning together in one clean workspace. It saves time, makes collaboration easier, and helps teams move faster with large data. Review collected by and hosted on G2.com.

What do you dislike about Databricks?

What I dislike about Databricks is that Auto Loader can become frustrating when source data changes frequently, especially if column names or datatypes shift without warning.

For example, a field like customer_id may suddenly come in as cust_id, or a column that was previously a string may start arriving as an integer, which can cause schema drift and break downstream processing.

I also find it inconvenient when schema inference is not fully accurate, such as when nested JSON or semi-structured data is read incorrectly, because it then requires extra manual fixes and maintenance to keep pipelines running smoothly. Review collected by and hosted on G2.com.

What problems is Databricks solving and how is that benefiting you?

Databricks is solving the problem of building and managing data pipelines at scale without so much manual effort. It helps with reliable ingestion, schema evolution, and orchestration, so teams can process data faster and keep pipelines more stable even when source files change.

For me, that means less time spent fixing broken jobs and more time focusing on transforming and using the data. It also benefits me by making batch and streaming workflows easier to manage in one platform, which is especially useful when data keeps changing. Review collected by and hosted on G2.com.

Response from Janelle Glover of Databricks

edit

We're glad to hear that you find Databricks to be a reliable platform for data engineering, analytics, and machine learning. We understand the frustration with Auto Loader when dealing with frequently changing source data. We are continuously working to improve schema inference accuracy and handling of nested JSON or semi-structured data to minimize manual fixes and maintenance for our users.

Banu Prakash M.

Data Engineer

Mid-Market (51-1000 emp.)

4/2/2026

"Databricks: Unified Platform for Data Processing and Analytics"

5/5

What do you like best about Databricks?

I like that Databricks brings everything into one place, making it unnecessary to use different tools for data processing, analytics, and pipeline work. It handles large data well, and we don't have to worry about managing clusters manually. Additionally, Databricks handles collaboration and experimentation well, making it easy to try out new things. Review collected by and hosted on G2.com.

What do you dislike about Databricks?

In my point of view, the one area that can be improved is cost management. If clusters aren't monitored carefully, costs can increase faster than expected. One improvement that would help is better visibility into costs at a more detailed level. More built-in alerts or recommendations when costs start increasing unexpectedly would also be helpful. Review collected by and hosted on G2.com.

What problems is Databricks solving and how is that benefiting you?

Databricks helps us handle large datasets and build data pipelines. It simplifies data processing, transforming, and analysis using Spark and SQL, all in one place. It solves the problem of slow data processing spread across systems, managing infrastructure automatically and facilitating collaboration and experimentation. Review collected by and hosted on G2.com.

Response from Janelle Glover of Databricks

edit

We're thrilled to hear that Databricks has been beneficial for handling large datasets and simplifying data processing and analysis for you. We appreciate your feedback on cost management and will explore ways to enhance cost visibility and provide better monitoring tools.

Supriya M.

Data Engineer

Mid-Market (51-1000 emp.)

3/31/2026

"A Reliable Workhorse for Data Engineering and Analytics"

5/5

What do you like best about Databricks?

The unified platform approach is what I appreciate most. Having notebooks, data engineering pipelines, ML workflows, and SQL analytics all in one place saves a ton of time instead of juggling multiple tools. The collaborative notebooks make it easy to share work with teammates, and the cluster management has gotten a lot smoother over time. Delta Lake integration is also a huge plus for keeping our data reliable and consistent. Review collected by and hosted on G2.com.

What do you dislike about Databricks?

The cost can get out of hand pretty quickly if you're not careful with cluster sizing and uptime. It's not always obvious how to optimize spending, and the pricing model feels complex. The learning curve for new team members is also steeper than I'd like, especially for people who aren't already familiar with Spark. Sometimes the UI can feel sluggish when working with larger notebooks, and debugging job failures could be more straightforward. Review collected by and hosted on G2.com.

What problems is Databricks solving and how is that benefiting you?

Databricks helps me resolve complex ETL pipeline failures and persistent data quality issues in supply chain analytics by unifying batch and streaming processing from SAP systems with Delta Live Tables. It also removes a lot of the infrastructure management headaches thanks to auto-scaling clusters, so I can stay focused on writing code for multi-terabyte workloads instead of constantly worrying about cluster sizing.

For my manufacturing data projects, Databricks accelerates development cycles from weeks to days via collaborative notebooks and DLT pipelines, enabling faster Power BI reporting and stakeholder decisions. Unity Catalog centralizes governance across Azure and SAP sources, preventing schema drift that plagued prior Hive-based lakes. Review collected by and hosted on G2.com.

Response from Janelle Glover of Databricks

edit

Thank you for highlighting the benefits of the unified platform approach and the time-saving features of Databricks. We understand your concerns about cost management and the learning curve, and we're continuously working to simplify our pricing model and improve the onboarding experience for new team members. It's great to hear how Databricks is helping you resolve complex ETL pipeline failures and accelerating development cycles for your manufacturing data projects.

Thoufeeq A.

DevOps Engineer

Mid-Market (51-1000 emp.)

4/2/2026

"All-in-One Powerhouse with Room for Pricing Clarity"

4.5/5

What do you like best about Databricks?

I like that Databricks is an all-in-one powerhouse where I can do multiple works in one place. It's powerful to manage data from multiple sources and have it in a single UC to manage permissions with row-level security. I also appreciate that I can create experiments, run multiple models, and select the best one from logs, which was difficult on other platforms. Once I learned the setup, it's been easy and comfy to work with. Review collected by and hosted on G2.com.

What do you dislike about Databricks?

I find it difficult to use the calculator to determine CPU serving endpoint prices because the documentation doesn't explicitly explain this. It only mentions 1 concurrency equals 1 DBU on the Azure page, which isn't clear. The pricing calculator has a single option for serving endpoints, labeled as medium with four DBU, but lacks separate options for GPU or CPU and their concurrency, making it hard to understand how it works properly. Initially, I also felt it was very tough to learn Databricks and manage deployments of workspaces, although it became easier over time. Review collected by and hosted on G2.com.

Response from Janelle Glover of Databricks

edit

Thank you for sharing your positive experience with Databricks. We understand your concerns about the pricing calculator and will take your feedback into consideration to improve the clarity of our documentation.

Vidhyadar R.

Data Engineer

Enterprise (> 1000 emp.)

4/1/2026

Business partner of the seller or seller's competitor, not included in G2 scores.

"Databricks Lakehouse Powerhouse with Unity Catalog and Fast Photon SQL"

4/5

What do you like best about Databricks?

I really value how the platform brings data lakes and warehouses together into one place. It makes managing data much easier, and the SQL performance is very fast thanks to the Photon engine. I also like the collaborative notebooks because they allow me to work with both SQL and Python seamlessly in a single environment. Review collected by and hosted on G2.com.

What do you dislike about Databricks?

The cost can be high, and the DBU billing system is quite complex to track. I also found that there is a significant learning curve when it comes to Spark and configuring clusters. For smaller, quick tasks, the setup time and technical overhead can sometimes feel like a bit too much. Review collected by and hosted on G2.com.

Response from Janelle Glover of Databricks

edit

We appreciate your feedback on the benefits of Databricks, such as the centralized data management and the ability to work with SQL and Python in a single environment. We understand your concerns about cost and the learning curve, and we're actively working to enhance the platform to better meet your needs.

Sivabalan A.

Data Engineer

Mid-Market (51-1000 emp.)

4/2/2026

"Unified Data Engineering, Science, and Analytics in One Collaborative Platform"

4.5/5

What do you like best about Databricks?

What I appreciate most about Databricks is its ability to unify data engineering, data science, and analytics on a single platform. The collaborative environment—especially the notebooks and integrated workflows—makes it much easier for teams with different skill levels to work together without constant context-switching.

Another highlight is the integration with popular tools and cloud services that are widely used in the market today, which makes it easier to move data between them. The performance monitoring and job scheduling features help maintain visibility over pipelines, and the Delta Lake support for reliable data management has also been very useful. Review collected by and hosted on G2.com.

What do you dislike about Databricks?

Cost management is one area that could be improved. While Databricks offers autoscaling and flexible cluster options, it’s easy for resource usage to escalate unexpectedly, especially with large datasets and long-running jobs. Keeping costs predictable often requires careful oversight and a solid understanding of the platform’s pricing model.

Additionally, some of the more advanced features—such as fine-grained access controls and more complex job orchestration—can feel less intuitive. The documentation is extensive, but it occasionally leaves gaps that end up requiring trial and error. Review collected by and hosted on G2.com.

What problems is Databricks solving and how is that benefiting you?

Databricks addresses several key challenges in modern data workflows, particularly around scalability, data reliability, and collaborative analytics. One major problem it solves is managing and processing large-scale datasets efficiently. By leveraging Apache Spark’s distributed computing framework, Databricks enables parallelized ETL pipelines and large-scale data transformations that would be impractical on traditional infrastructure.

Another challenge is ensuring data consistency and reliability across pipelines. With Delta Lake, Databricks provides ACID-compliant storage, versioned tables, and schema enforcement, which reduces data errors and simplifies data governance. This is especially beneficial when multiple teams are working on different stages of data pipelines at the same time.

Databricks also helps solve the problem of fragmented workflows for data scientists and engineers. Its unified environment supports multiple languages (Python, SQL, R, Scala) and includes integrated machine learning with MLFlow, making it easier to collaborate and move from data preparation to analytics and ML in one place. Review collected by and hosted on G2.com.

Response from Janelle Glover of Databricks

edit

It's great to hear how Databricks is helping address scalability, data reliability, and collaborative analytics challenges for your team. We appreciate your feedback on cost management and advanced feature usability. We are continuously working to improve our pricing transparency and enhance the user experience for all our features.

Vijayaramuprawin V.

Sr. Cloud and DevOps Engineer

Mid-Market (51-1000 emp.)

4/1/2026

Business partner of the seller or seller's competitor, not included in G2 scores.

"All-in-One Platform That Helps Us Iterate Fast and Deploy with Confidence"

5/5

What do you like best about Databricks?

We use Databricks daily as our core data platform for building and running pipelines across a medallion architecture, from extracting data out of SAP and Arkieva all the way to reporting-ready datasets. The notebook experience is intuitive, the feature set is massive, and Asset Bundles have made our CI/CD story with Azure DevOps really solid. Integration with cloud services was smooth, and once things are set up they just work. The learning curve can be steep for newer team members, especially around things like Unity Catalog and DABs, and costs can creep up if you're not staying on top of cluster configurations. Support is decent and the docs are strong enough that we rarely need to open a ticket. Overall, it's a powerful platform that does a lot under one roof, and it's hard to imagine our data engineering workflow without it. Review collected by and hosted on G2.com.

What do you dislike about Databricks?

The cost can creep up fast if you're not careful with cluster sizing and job configurations, so it takes some effort to keep things optimized. Also, the learning curve for newer team members can be steep, especially around things like Asset Bundles, Unity Catalog, and getting the CI/CD pieces wired up properly. Review collected by and hosted on G2.com.

What problems is Databricks solving and how is that benefiting you?

Databricks is solving the problem of having fragmented data spread across multiple systems like SAP and Arkieva by giving us one unified platform to extract, transform, and serve it all. That means our business teams get clean, reliable, reporting-ready data without us having to juggle a bunch of separate tools, and we can deploy and manage everything consistently across environments with confidence. Review collected by and hosted on G2.com.

Response from Janelle Glover of Databricks

edit

We're glad to hear that Databricks has been instrumental in streamlining your data engineering workflow and providing a powerful platform for your needs. We appreciate your feedback on the learning curve and cost considerations, and we're continuously working to improve in these areas.

Dharun T.

Senior Data Engineer

Mid-Market (51-1000 emp.)

4/1/2026

"Streamlined, Collaborative Data Workflows with Powerful Performance"

5/5

What do you like best about Databricks?

What I like most about Databricks is how it streamlines the entire data workflow by bringing processing, analysis, and machine learning into one platform. The collaborative notebook environment makes it easy to share code, context, and reasoning with teammates, which helps everyone stay aligned. It also performs strongly on large datasets while abstracting away most of the cluster management, so I can focus on solving the problem rather than dealing with infrastructure. On top of that, centralized access control and clear visibility into data usage support responsible data governance, offering a solid balance between power and ease of use. Review collected by and hosted on G2.com.

What do you dislike about Databricks?

Databricks has a few downsides, although many of them feel more like trade-offs than outright negatives. My biggest concern is cost: if clusters aren’t managed carefully, expenses can climb quickly, even though the platform can scale very efficiently when it’s tuned properly. There’s also a real learning curve with Spark and distributed computing concepts, and debugging or performance tuning can be more involved than with simpler tools. Lastly, because it’s a managed service, you give up some low-level control compared with self-hosted systems, but the upside is that it takes a lot of the operational and infrastructure work off your plate. Review collected by and hosted on G2.com.

Response from Janelle Glover of Databricks

edit

We're glad to hear that you find Databricks to be a powerful and streamlined platform for collaborative data workflows. We understand the concerns about cost management and the learning curve associated with distributed computing concepts. We continuously work to improve our platform and provide resources to help users optimize their usage and overcome challenges.